AITopics

2511.03107

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

arXiv.org Artificial IntelligenceOct-15-2025

VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents

Lee, Sam Yu-Te, Ji, Chenyang, Wen, Shicheng, Huang, Lifu, Liu, Dongyu, Ma, Kwan-Liu

Text analytics has traditionally required specialized knowledge in Natural Language Processing (NLP) or text analysis, which presents a barrier for entry-level analysts. Recent advances in large language models (LLMs) have changed the landscape of NLP by enabling more accessible and automated text analysis (e.g., topic detection, summarization, information extraction, etc.). We introduce VIDEE, a system that supports entry-level data analysts to conduct advanced text analytics with intelligent agents. VIDEE instantiates a human-agent collaroration workflow consisting of three stages: (1) Decomposition, which incorporates a human-in-the-loop Monte-Carlo Tree Search algorithm to support generative reasoning with human feedback, (2) Execution, which generates an executable text analytics pipeline, and (3) Evaluation, which integrates LLM-based evaluation and visualizations to support user validation of execution results. We conduct two quantitative experiments to evaluate VIDEE's effectiveness and analyze common agent errors. A user study involving participants with varying levels of NLP and text analytics experience -- from none to expert -- demonstrates the system's usability and reveals distinct user behavior patterns. The findings identify design implications for human-agent collaboration, validate the practical utility of VIDEE for non-expert users, and inform future improvements to intelligent text analytics systems.

large language model, machine learning, natural language, (21 more...)

2506.21582

Country: North America > United States (0.46)

Genre:

Workflow (1.00)
Questionnaire & Opinion Survey (1.00)
Research Report > New Finding (0.67)
Personal > Interview (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

McGee, Monnie, Sadler, Bivin

Generative AI Takes a Statistics Exam: A Comparison of Performance between ChatGPT3.5, ChatGPT4, and ChatGPT4o-mini

arXiv.org Artificial IntelligenceJan-15-2025

Many believe that use of generative AI as a private tutor has the potential to shrink access and achievement gaps between students and schools with abundant resources versus those with fewer resources. Shrinking the gap is possible only if paid and free versions of the platforms perform with the same accuracy. In this experiment, we investigate the performance of GPT versions 3.5, 4.0, and 4o-mini on the same 16-question statistics exam given to a class of first-year graduate students. While we do not advocate using any generative AI platform to complete an exam, the use of exam questions allows us to explore aspects of ChatGPT's responses to typical questions that students might encounter in a statistics course. Results on accuracy indicate that GPT 3.5 would fail the exam, GPT4 would perform well, and GPT4o-mini would perform somewhere in between. While we acknowledge the existence of other Generative AI/LLMs, our discussion concerns only ChatGPT because it is the most widely used platform on college campuses at this time. We further investigate differences among the AI platforms in the answers for each problem using methods developed for text analytics, such as reading level evaluation and topic modeling. Results indicate that GPT3.5 and 4o-mini have characteristics that are more similar than either of them have with GPT4.

large language model, machine learning, natural language, (19 more...)

2501.09171

Country: North America > United States (0.93)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.69)
Instructional Material > Course Syllabus & Notes (0.67)

Industry:

Health & Medicine (1.00)
Education > Educational Setting > Higher Education (1.00)
Education > Curriculum > Subject-Specific Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Duan, Xiaojing, Lalor, John P.

H-COAL: Human Correction of AI-Generated Labels for Biomedical Named Entity Recognition

arXiv.org Artificial IntelligenceNov-20-2023

With the rapid advancement of machine learning models for NLP tasks, collecting high-fidelity labels from AI models is a realistic possibility. Firms now make AI available to customers via predictions as a service (PaaS). This includes PaaS products for healthcare. It is unclear whether these labels can be used for training a local model without expensive annotation checking by in-house experts. In this work, we propose a new framework for Human Correction of AI-Generated Labels (H-COAL). By ranking AI-generated outputs, one can selectively correct labels and approach gold standard performance (100% human labeling) with significantly less human effort. We show that correcting 5% of labels can close the AI-human performance gap by up to 64% relative improvement, and correcting 20% of labels can close the performance gap by up to 86% relative improvement.

ai-generated label, correction, h-coal, (13 more...)

2311.11981

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Rizun, Nina, Revina, Aleksandra, Edelmann, Noella

Application of Text Analytics in Public Service Co-Creation: Literature Review and Research Framework

arXiv.org Artificial IntelligenceMay-20-2023

The public sector faces several challenges, such as a number of external and internal demands for change, citizens' dissatisfaction and frustration with public sector organizations, that need to be addressed. An alternative to the traditional top-down development of public services is co-creation of public services. Co-creation promotes collaboration between stakeholders with the aim to create better public services and achieve public values. At the same time, data analytics has been fuelled by the availability of immense amounts of textual data. Whilst both co-creation and TA have been used in the private sector, we study existing works on the application of Text Analytics (TA) techniques on text data to support public service co-creation. We systematically review 75 of the 979 papers that focus directly or indirectly on the application of TA in the context of public service development. In our review, we analyze the TA techniques, the public service they support, public value outcomes, and the co-creation phase they are used in. Our findings indicate that the TA implementation for co-creation is still in its early stages and thus still limited. Our research framework promotes the concept and stimulates the strengthening of the role of Text Analytics techniques to support public sector organisations and their use of co-creation process. From policy-makers' and public administration managers' standpoints, our findings and the proposed research framework can be used as a guideline in developing a strategy for the designing co-created and user-centred public services.

artificial intelligence, natural language, public service, (14 more...)

2305.18316

Country:

Europe > Denmark (0.14)
Asia > China (0.14)
North America > United States > New York > New York County > New York City (0.04)
(12 more...)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)

Industry:

Government > E-government (0.47)
Government > Regional Government > Europe Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)

#artificialintelligenceNov-27-2022, 08:47:37 GMT

5 Completely FREE Natural Language Processing Courses

Text Analytics 2: Visualizing Natural Language Processing is a practical course. There are 3 modules in this course. In the first module, you will learn Text Analytics and Human Cognition, Measuring Linguistic Similarity, Topic Modelling, etc. The next lesson will cover how to visualize text analytics. The last section of this course covers how to apply text analytics to New Fields.

best nlp course online free, language processing, natural language processing, (8 more...)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education > Educational Setting > Online (0.93)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

#artificialintelligenceNov-20-2022, 18:20:12 GMT

Expanding AI technology for unstructured biomedical text beyond English

The health industry is embracing the power of big data, cloud computing, and clinical analytics, harnessing data to deliver insights that can improve care and efficiency. Still, unstructured text remains a challenge--made even more complex by barriers of language. Doctors' notes and other unstructured text are often left unreferenced, are hard to parse and learn from, and are difficult to extract insights from, which leads to missed opportunities for diagnosis and better care. Microsoft recognizes the need to enable healthcare organizations worldwide to gather insights from this data--for better, faster, and more personalized care, and to improve health equity. With Text Analytics for Health, a part of Azure Cognitive Services, healthcare organizations around the world can now extract meaningful insights from unstructured text in seven languages and process it in a way that enables clinical decision support like never before.

health, text analytic, unstructured biomedical text, (14 more...)

Country:

South America > Brazil (0.06)
Asia > Middle East > Israel (0.05)
North America > United States (0.05)

Genre: Research Report > Experimental Study (0.32)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Health Care Providers & Services (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

#artificialintelligenceJul-26-2022, 17:05:50 GMT

Azure Bicep: Deploy a Cognitive Services container image for Text Analytics.

This article will review how to use Azure Bicep to deploy a Cognitive Services resource and an Azure Container Instances resource to create a container image that can be used for text analytics. Before you move forward, take a moment to read the below article that explains in detail the architecture and objectives. Let's analyze the Bicep template. Create a new file in your working directory and name it'main.bicep'. Note we declare two resources: the Azure Cognitive Service resource and the Azure Container Instance resource.

azure bicep, bicep template, cognitive service container image, (5 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.63)

#artificialintelligenceJun-3-2022, 16:31:24 GMT

Blueprints for Text Analytics Using Python: Machine Learning-Based Solutions for Common Real World (NLP) Applications: Albrecht, Jens, Ramachandran, Sidharth, Winkler, Christian: 9781492074083: Amazon.com: Books

This book is intended to support data scientists and developers so they can quickly enter the area of text analytics and natural language processing. Thus, we put the focus on developing practical solutions that can serve as blueprints in your daily business. A blueprint, in our definition, is a best-practice solution for a common problem. It is a template that you can easily copy and adapt for reuse. For these blueprints we use production-ready Python frameworks for data analysis, natural language processing, and machine learning.

blueprint, machine learning-based solution, natural language processing, (11 more...)

Industry: Retail > Online (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.71)